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i to the discrimination between transmitting and nor- 
was cru . cia Yunis and Soreng 16 first showed that caffeine, which 
( ° aI S istic effects with a variety of DNA damaging agents, 
has synerg 1 ion of common fragile sites and fragile X when 
cn hances **P ion with fl U orodeoxyuridine (FUdR). Although 
us< d ,n , c0 {i{ crs i 8 - 19 have failed to replicate caffeine's enhancing 
* e and ot x ex p r ession in lymphocytes or lymphoblasts, 
f fleet on trag increased expression in hybrid cells 6 - 7 . This 
lt has cons be due to the fact that caffeine's effect on fragile 
difference m y g ^ be indirect . it inhib j ts the mitotic delay 
\ express f ma . a2e thereby reducing the time 
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t has cons be due to the fact that caffeine's effect on fragile 
lif f C rence m y g ^ be indirect . it inhib i ts the mitotic delay 

, Associated with DNA damage, thereby reducing the time 
i\ for DNA replication, repair and chromosome con- 
iva ,labie 7 ^ effect may vary among ceI1 types w i tft different 

aensal 'l characteristics, particularly different lengths of G 2 . 
ceM ' hieh prevalence and unusual segregation patterns 
H in fragile X syndrome suggest that it is not a classical 
obS - t d mutation 3,12 * 20 - The single most unusual characteristic 
X / 1,n nheritance is its variable penetrance within pedigrees. For 
° le the penetrance of mental retardation is significantly 
offspring of mothers of transmitting males than in 
l TnrL of daughters of transmitting males 12 . This observation 

heen referred to as the 'Sherman paradox' 21 and must be 
h3S |Led by any model addressing inheritance of fragile X. 
^ unique ability of the hybrid system to induce fragile X 

ession in transmitting and normal males and to demonstrate 
C *,nntinuum of cytogenetic expression (low in normal males, 
J .her in transmitting males, highest in affected males) argues 

nclv against the involvement of autosomal modifying loci 
'this disorder. Instead, the presence of a common fragile site 
* \ q V is consistent with the multi-step mutation models of 
Pembrev et dl 20 and Nussbaum et al 22 . 

We speculate that recombination between two normal X 
chromosomes at the Xq27 common fragile site region may be 
he mechanism for an initiation event, producing a clinically 
normal transmitting male or carrier female. Segregation data 
demonstrates that progression to the full XLMR mutation occurs 
oniv in females 20 (who have two X chromosomes), also suggest- 
in(E 'that meiotic recombination is involved. These recombination 
events may produce variation in the length or copy number of 
j DNA sequence at Xq27, which at some threshold level disrupts 
the function of a neighbouring gene and produces XLMR. 
Quantitative variation in the size of the fragile site DNA region 
may also explain the variable recombination distances between 
polymorphic DNA markers observed among fragile X families . 
Thus our data suggest that a normal DNA sequence at Xq27 
may be altered to produce a continuous quantitative variation 
m fragile site DNA resulting in varying degrees of cytogenetic 
expression and a threshold for clinical manifestation of the 

mutation. ... 
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Allelic sequence variation has been analysed by synthetic 
oligonucleotide hybridization probes which can detect single base 
substitutions in human genomic DNA 1 " 3 . An allele-specific 
oligonucleotide (ASO) will only anneal to sequences that match 
it perfectly, a single mismatch being sufficient to prevent hybridiz- 
ation under appropriate conditions 4 . To improve the sensitivity, 
specificity and simplicity of this approach, we used the polymerase 
chain reaction (PCR) procedure 5 to enzymatically amplify a 
specific segment of the 0-globin or HLA-DQa gene in human 
genomic DNA before hybridization with ASOs. This in vitro 
amplification method, which produces a >10 5 -fold increase in the 
amount of target sequence, permits the analysis of allelic variation 
with as little as 1 ng of genomic DNA and the use of a simple 
'dot blot' for probe hybridization. As a further simplification, 
PCR amplification has been performed directly on crude cell 
lysates, eliminating the need for DNA purification. 

32 P-labelled ASO probes have been used in the analysis of a 
number of genetic diseases caused by single base mutations, 
including sickle cell anaemia 1 , a-1 anti-trypsin deficiency and 
/}- thalassemia 3 . Although the ASO method is a powerful diag- 
nostic tool, it requires 5 to 10 u.g of genomic DNA, gel-purified 
high specific activity probes and several days of autoradio- 
graphic exposure. The short oligomer probes (usually 19 bases 
long) also hybridize to other genomic sequences so that restric- 
tion endonuclease digestion and gel electrophoresis are required 
to separate the target sequence from the bulk of the genomic 
DNA. In order to develop a simple, sensitive method, we chose 
the sickle cell anaemia and haemoglobin C mutations in the 
sixth codon of the /3-globin gene 6 as a model system for genetic 
diagnosis. We used ASOs specific for the normal (0 A ), sickle 
cell (/3 s ), and haemoglobin C (0 C ) sequences as probes to detect 
these alleles in PCR-amplified genomic samples. Figure 1 shows 
the sequence of the PCR primers (PC03 and PC04), the ASO 
probes ( 19 A, 19S and 19C), and their relationship to the /3-globm 
gene. The PCR primers, both 20 bases in length, anneal to 
opposite strands of the genomic DNA flanking the target 
sequence 7 . These primers are positioned so that the polymerase- 
catalysed, template-directed extension product of one primer 
can serve as a template strand for the other. In this configuration, 
repeated cycles of denaturation, primer annealing, and primer 
extension, result in the exponential accumulation of a 110-base 
pair (bg^^ob^agment, with a length defined by the dist- 
ance hetwe^ : 4el:5^^Kis of the PCR primers. For example, 20 
cycles of PCR'tmplification with PC03 and PC04 as primers 
produce a 220,000-fold increase in the amount of 0-globin target 
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110 bp 



I 



5'- 



3' 



5'- 



3' 



PC03 



19A 
19S 
19C 



PC04 



PGR primers 



PC03: ACACAACTGTGTTCACTAGC 
PC04: CAACTTCATCCACGTTCACC 



ASO probes 



19C: CTCCT[AjAGGAGAAGTCTGC 
19A: CTCCTGAGGAGAAGTCTGC 
19S: CTCCTGfflGGAGAAGTCTGC 



Fig. 1 Sequence of the PCR primers and ASO probes and their 
relationship to the /3-globin gene. The two PCR primers, PC03 
and PC04, are 20 bases long and are complementary to the (-) 
and'(+) strands, respectively. The three ASO probes, 19A, 19S, 
and 19C, are 19 bases long and span codons 4-17 of the gene 8 . 
19A is complementary to the normal £-globin allele (0 A ), 19S to 
the sickle cell allele (£ s ), and 19C to the haemoglobin C allele 
(/?°). Boxes, single base mutations of /J s and 0°. 



sequence 7 . The sequences of the 19-base ASO probes used here 
are identical to those described previously by Studencki et al*. 

DNA was extracted from six blood samples from individuals 
with every possible diploid combination of the f$ A t f$ s , and p c 
alleles and from the cell line GMZ 2064, which has a homozy- 
gous deletion of the 0-globin gene 9 . The DNA (1 jig from each 
sample) was subjected to 25 cycles of PCR amplification and 
l/30th of the reaction product (33 ng) applied to a nylon filter 
as a 'dot blot*. Three replicate filters were prepared and each 
was hybridized with one of the three 32 P-labelled ASOs under 
stringent conditions. The autoradiogram (Fig. 2) clearly indi- 
cates that each ASO annealed only to those DNA samples which 
contained at least one copy of the 0-globin allele to which the 
probe was perfectly matched. For example, the 0 A -specific 
probe, 19A, hybridized only to samples AA (0 A 0 A ), AS (0 A j3 s ), 
and AC (p A p c ). The high degree of specificity contributed by 
PCR amplification is further emphasized by the lack of any 
detectable hybridization of the ASO probes to the GM2064 
deletion mutant. The frequency of the specific /?-globin target 
in PCR amplified DNA has been estimated by the analysis of 
cloned amplification products to be approximately 1%, an . 
enrichment of > 10 5 over unamplified genomic DNA 10 . It is this 
substantial reduction of complexity that allows the application 
of 19-base probes in a dot blot format and also makes possible 
the use of even shorter ASO probes capable of allelic discrimina- 
tion under less stringent conditions (data not shown). 

To estimate the minimum amount of DNA detectable, eight 
serial dilutions containing 1 to 128 ng of genomic AA DNA 
were amplified by 25 cycles and half of each sample was applied 
to a dot blot, then hybridized with the 19A probe. Equivalent 
dilutions of the 25-cycle amplifications of AA and SS DNA 
prepared in the previous experiment were included as controls. 
The autoradiogram (Fig. 3) reveals signals in all AA samples; 
the overall efficiency of amplification with nanogram quantities 
of template DNA is comparable to that obtained with 1 u,g. 
Moreover, the subnanogram detection limit obtained with 32 P- 
labelled oligonucleotide probes suggests that the analysis of 
nanogram quantities of genomic DNA with nonradioactive ASO 
probes (for example, biotin-labelled) should be feasible. Since 
one nanogram is the amount of genomic DNA present in only 
150 diploid cells (assuming a haploid genome size of 3 x 10 9 bp) 
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Fig. 2 Genotype analysis of PCR-amplified genomic DNA using 
ASO probes. DNA was extracted from the buffy coat fractions of 
clinical blood samples of known /3-gIobin genotype 11 and 1 jtg 
portions of AA (0 A /3 A ), AS(0 A 0 s ), SS (0 S 0 S ), SC (j8 s 0 s ), CC 
(^3 C /3 C ), AC (^3 A /3 C ), and GM2064 (deletion mutant, see text) 
DNA were subjected to 25 cycles of PCR amplification as described 
previously 7 . Briefly, the samples were dispensed in 100^.1 PCR 
reaction buffer (50 mM NaCl, 10 nM Tris, pH 7.6, 10 mM MgCl 2 , 
1 m-M PC03, 1 m-M PC04, 1.5 mM dATP, 1.5 mM dCTP, 1.5 mM 
dTTP, and 1.5 mM dGTP). After heating (lOmin at 95 °C) to 
denature the DNA, the samples were transferred to a 30 °C heat 
block for 2 min to allow the primers to anneal to their target 
sequences. Primer extension was initiated by addition of one unit 
of Klenow fragment of Escherichia coli polymerase I (New England 
Biolabs) and incubated for 2 min at 30 °C. This cycle of heating, 
annealing, and extension was repeated 24 times; subsequent 
denaturations were by heating for 2 min. The final volume after 
25 cycles was 150 One-thirtieth of each reaction, (5 pi, contain- 
ing 33 ng of the original genomic DNA), was adjusted to 0.4 N 
NaOH, 25 mM EDTA 17 in a 200 u,l volume and applied to a 
Genatran-45 nylon filter (Plasco) with a Bio-Dot spotting apparatus 
(Bio-Rad). Three replicate filters were prepared. The ASO probes 
were phosphorylated at their 5'-termini with [y- 32 P] ATP and poly- 
nucleotide kinase and purified by spin dialysis 11 . The specific 
activities of the probes were between 3.5 and 4.5 pCi pmol -1 . Each 
filter was prehybridized individually in 8 ml 5xSSPE, 5x 
Denhardt, 0.5% SDSfor30min at 55 °C Probed pmol) was then 
added and hybridized at 55 °C for 1 h. Filters rinsed twice in 2x 
SSPE, 0.1% SDS at room temperature, followed by a high strin- 
gency wash in 5 x SSPE, 0.1% SDS for 10 min at 55 °C (for 19C) or 
60 °C (for 19A and 19S) and autoradiographed for 2.5 h at -80 °C 
with a single intensification screen (Dupont Lightning Plus). 



we considered the possibility that a crude lysate from a small 
number of cells would contain sufficiently little cellular debris 
to allow the DNA to serve as template for the PCR amplification 
procedure. Eight dilutions containing 75 to 9,600 cells from each 
of the cell lines MOLT-4 (/3 A 0 A V\ SC-1 (/? S /3 S )'\ and GM2064 
were heated at 95 °C to lyse the ceils and subjected to 25 cycles 
of amplification without purification. Equivalent amounts of 
purified MOLT-4 DNA were included as controls. The amplified 
samples were divided onto two dot blots and hybridized with 
the 19A and 19S probes. The autoradiogram (Fig. 4) shows a 
positive signal in an unfractionated cell lysate of only 75 cells. 
The efficiency of amplification appears to be only two- to three- 
fold less than with equivalent amounts of purified DNA. 
(MOLT-4 is reported to be 90% tetraploid (A. Green, personal 
communication) so that 150 cells would be expected to contain 
2 ng of genomic DNA.) Both ASO probes retain allele 
specificity: 19A anneals only to MOLT-4 DNA and 19S only to 
SC-1 DNA. The variation from the expected two-fold reduction 
of signal intensities may reflect dilution artefacts associated with 
manigyjating such small numbers of cells, and the significant 
reduction tn signal intensity seen in both 9,600-cell samples may 
be due to inhibition of the amplification process by cellular 
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pig. 3 PCR amplification of sub-microgram amounts of genomic 
q^A. a, Eig nt seriaI dilutions containing 128, 64, 32, 16, 8, 4, 2, 
n d 1 ng of sample AA DNA were amplified by 25 cycles. One 
half of the amplification product (64 to 0.5 ng) was applied to a 
filter (row A). Controls consisted of equivalent dilutions of 6, 
previously amplified AA and c, previously amplified SS DNA (Fig. 
2} The filter was hybridized with the 19A probe and exposed for 
6 h. See Fig. 2 (legend) for methods. 



jcbris. Use of unfractionated lysate reduces the time required 
»* or analysis to <8 hours. 

pCR amplification and ASO hybridization have also been 
^cd to analyse genetic polymorphism at the HLA-DQa locus, 
tfhich has localized allelic variability in the second exon (Fig. 

Using PCR primers complementary to conserved regions 
0 ( this exon, a 242-bp fragment of DQa was amplified from 
ptf A samples isolated from nine homozygous typing cell lines 
HTC) and from two heterozygous cell lines, then hybridized 
■*-ith four different ASO probes (Figure 5b). The sequence of 
these probes was based on published data 12 " 15 as well as on our 
determinations of the sequences of complementary DNA, 
genomic and PCR-ampIified clones (ref. 12 and Fig. 5, legend), 
polymorphism in the HLA-DQ region, which consists of one a 
and one J3 chain locus, is detected serologically as the DQwl, 
DQw2, and bQw3 antigen series 16 . The DQA1 probe hybridizes 
selectively to amplified DNA from all cells typed as DQwl, 
which is associated with the DR1, DR2, and DRw6 serological 
tvpes, As expected from the gene sequence, this probe also 
hybridizes to the DNA from the DR8 homozygous cell line TAB 
which is also DQwl. The DQA3 probe hybridizes to DNA from 
the DR3, DQw2 line AVL as well as to the unknown sequence 
<n the DR5, DQw3 line JGL suggesting that these DQa alleles 
ire homologous. Since DQA3 did not hybridize to the DNA 
from the other D.Qw3 lines (DKB and KOZ), there must be 
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Fig. 4 Genotype analysis of PCR-amplified crude cell lysates 
using ASO probes. Cell concentrations of the cell lines MOLT-4 
(0 A /3 A ), SC-1 (0 S 0 S ), and GM2064 (see text) were determined 
with a haemocytometer. Eight serial dilutions in PBS were prepared 
at 9.6xl0 5 , 4.8X 10 5 , 2.4xl0 5 , 1.2x10 s , 6.0xl0 4 , 3.0xl0 4 , 1.5 x 
10 4 , and 7.5 x 10 3 cells per ml. Ten jxl of each dilution (containing 
9,600, 4,800, 2,400, 1,200, 600, 300, 150 and 75 cells), were mixed 
with 35 jil H 2 0 and heated for 10 min at 95 °C to lyse the cells. 
The lysates were added to 55 *il, 1.8 x PCR buffer (containing salts, 
primers, and dNTPs) and immediately subjected to 25 cycles of 
amplification. Equivalent amounts (64 to 0.5 ng) of purified MOLT- 
4 genomic DNA were also amplified. The PCR reaction products 
were divided onto two filters, probed with 19A and 19S and exposed 
for 6 h. Methods are as for Fig. 2 except that the amplification 
buffer included 10% DMSO (v/v), the annealing and extension 
reactions were performed at 37 °C instead of 30 °C 10 and the PCR 
amplification procedure was carried out in an automated device 
designed by the Cetus Instrumentation group. 

DQa sequence heterogeneity within this specificity. Thus the 
allelic variation in the HLA-DQa locus detected by this method 
appears to be greater than the polymorphic series defined by 
serological typing, and can also be more precisely localized. 
This approach is at present being applied to the analysis of 
polymorphism in the HLA-DQ/? and DR/3 loci. 

With the use of nonisotopic probes and PCR automation (see 
Figs 4 and 5), this procedure, combining in vitro target 
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Fig. 5 Detection of 
allelic variation in HLA- 
DQa sequences, a, Allelic 
variation in the HLA- 
DQa gene; b, detection of 
HLA-DQa alleles. Pro- 
tein sequences of the vari- 
able outer domain of the 
HLA-DQa chain, derived 
from cloned cDNA and 
PCR-amplified 10 sequen- 
ces, are shown as differen- 
ces from the sequence found in the DR1 (ref. 13) haplotype (upper line), and grouped to emphasize their similarities. The sequences from 
the DR1, DR2 (ref. 13) and DR6 (ref, 14) haplotypes (DQwl) are closely related, and those from the DR4 (ref. 14) and DR9 (ref. 16) 
haplotypes (DQw3) are identical. The region between positions 47 and 56 shows the greatest variation, and was therefore chosen as a target 
for four different ASO probes. In this segment, the DQa sequences from the DR1, 2, 6, and 8 (ref. 13) haplotypes are identical and 
homologous to the DQA1 ASO(5'-AACCTCCAAATTTGCTGAACTCAGGCCACCG-3'). The sequence from the DR7 (ref. 15) haplotype 
is homologous to the DQA7 ASO (TTCCACAGACTTAGATTTG) and the DR3 sequence (ref. 14) to the DQA3 probe 
fTGTTTGCCTGTTCTCAGAC). The sequences from the DR4 and DR9 haplotypes are identical and homogous to the DQA4 probe 
(TTCCGCAGATTTAGAAGAT). b, DNA was isolated 11 from nine homozygous typing cell lines and two heterozygous cell lines with 
HLA-DQa types DR1 (LG2), DR2 (PGF), DR3 (AVL), DR4 (DKB), DR5 (JGL), DR6 (APD), DR7 (LBF), DR8 (TAB), DR9 (KOZ), 
DR1,3 (B302), DR3,4 (B303). Each DNA sample (1 u,g) was PCR-amplified for 28 cycles with the DQa primers as described (Fig. 4). The 
degree of amplification was determined by hybridization to a DQa cDNA probe labelled with 32 P by nick-translation, and equal amounts 
of product (2-10% of the final reaction volume) was denatured with two volumes of 0.5 M NaOH, 1.5 M NaCl. After neutralization with 
0.5 M Tris, pH 7, 1.5 M NaCl to a final volume of 200 |xl, the DNA was spotted in four replicates on MSI Nylon 66 (Fischer Scientific) and 
fixed to the filters by UV irradiation for 4 min. The filters were prehybridized for 1 h at 50° in 6xSSPE, lOxDenhardt, 0.5% SDS, and 
hybridized overnight with the ASO probes (at 2 x 10 5 cpm ml -1 ) labelled with 32 P as described (Fig. 2), washed for 10 min at 37° in 0.1 x SSPE, 
0.1% SDS, and autoradiographed for 90 min with an intensification screen. 
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amplification and ASO probes in a dot-blot format, promises 
to be a general and simple method for the detection of allelic 
variation. In addition, the ability to analyse genetic variation 
rapidly in minute amounts of purified DNA or cell lysates has 
important implications for a wide variety of genetic analyses. 

We thank Corey Levenson, Lauri Goda and Dragan Spasic 
for the synthesis of oligonucleotides, Norm Arnheim for dis- 
cussions, and Larry Johnson and Richard Leath for instrument 
and engineering support. We also thank John Sninsky for critical 
review, and Kathy Levenson, Eric Ladner, Tim Culp and Sharon 
Nilson for preparation of this manuscript. 
Note added in proof : Recently, 50 ng samples of PCR-amplified 
genomic DNA have been analysed using a biotin-labelled ASO 
probe. 
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The detection of haemoglobins in distantly-related non-legume 
plant families 1 - 2 as well as in the legume families raises the question 
of whether the origin of the plant haemoglobins was single or 
multiple. We have isolated a haemoglobin gene from Parasponia 
andersonu (Ulmaceae) which shows more than 50% nucleotide 
sequence homology with the haemoglobin genes of legume plants 
and has three introns at identical positions to the leghaemoglobin 
introns*" 3 . The Parasponia gene has homology to haemoglobin 
genes in another distantly-related nodulating plant, Casuarina. It 
also hybridizes at high stringency to sequences in a related, but 
non-nodulating genus, Trtma. We conclude that the globin gene 
family may be widespread in modern plants, that plant haemo- 
globins may have a cryptic function in non-symbiotic tissue and 
that plant haemoglobins have evolved by vertical descent, probably 
from an ancestor common to modern plants and animals. 

A number of plants outside the legume families are known 
to form nitrogen-fixing root nodules in symbiotic association 
with Rhizobium 6 or actinomycetes 7 . Haemoglobin has been 
detected in nodules of several non-legume plants 1,2 and charac- 
terized from Parasponia (Ulmaceae) 8,9 and Casuarina 
(Casuarinaceae) 10 . Modern systems of plant classification show 
the known nodulating, haemoglobin-containing species to be 
widely distributed over several superorders of dicotyledons • * 
(Fig. 1). No continuous evolutionary pattern is evident. 
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Fig. 1 Phylogenetic relationship of dicotyledonous plant families 
in which some species have been shown to form nitrogen-fixing 
nodules 7 . Parasponia is the only genus of the Ulmaceae which has 
been shown to nodulate, whereas in the legume families nearly all 
genera of the Fabaceae and Mimosaceae nodulate; the Caesal- 
piniaceae are mostly non-nodulating. Families in which at least 
one species has been shown to contain nodule haemoglobin 1 * 2 are 
marked with an asterisk. Solid line, superorders according to 
Dahlgren 12 ; dotted line, according to Cronquist 11 . Note that these 
superorders also contain -150-250 (depending on the 
classification system) non-nodulating families not listed here. Iht 
N 2 -fixing symbiosis in legumes is always with Rhizobium bacteria, 
Parasponia is the only known non-legume genus to be nodulated 
by Rhizobium strains, some of which also infect legumes 6 . Other 
non-legumes undergo symbiosis with Frankia, an actinomycete\ 

Although there is 40% protein homology between soybean 
and Parasponia haemoglobin 9 , a soybean leghaemoglobin com- 
plementary DNA (cDNA) clone did not hybridize to Parasponia 
DNA or nodule RNA, even at reduced stringency. A 17mer and 
a 14mer oligonucleotide sequence deduced from the P. ander- 
sonii haemoglobin I protein sequence 9 were made (see Fig. 4). 
Both oligonucleotides hybridized to an RNA 750 bases long in 
Northern blots of Parasponia nodules (Fig. 2a). The 17mer was 
used as primer for synthesis of radioactive cDNA, which was 
used as a probe; it also hybridised to a 750-base RNA in 
Northern blots (Fig. 2b) and identified haemoglobin-specific 
clones in a nodule cDNA library. Two positive clones were 
analysed; pL209 contained the entire coding region for Para- 
sponia haemoglobin I whereas pL201 lacked the first 16 codons. 
Using pL209 as a probe, three independent genomic clones were 
isolated from a Bam HI-EMBL4 genomic library. All three 
clones contained one complete Parasponia haemoglobin gene 
on a 4.8 kilobase (kb) EcoRI end fragment within a 13-kb 
BamHl fragment. The sequences of the genomic clones, pL305 
and pL401, correspond to the cDNA clones pL209 and pL201 
respectively (Fig. 3). 

The coding region of the Parasponia haemoglobin gene spans 
486 nucleotides which translate into 161 amino acids excluding 
the N-terminal Met (Fig. 4). Predictions from the DNA sequence 
showed discrepancies with the published protein sequence . 
Resequencing of the relevant peptides confirmed an additional 
Phe at position 36, an additional Val at position 73 and a Cys 
at position 78 instead of the previously reported Thr. Four 
additional amino acids (Ser-Ser-Ser-Glu) were found at the 
N-terminus (Fig. 4). Parasponia haemoglobin has the widely 
conserved globin residues C2-Pro, CDl-Phe, proximal His (E7) 
and: distal His (F8) and, as in other plant haemoglobins, an 
extended EF-region 13 . The protein coding DNA region of the 
Parasponia haemoglobin gene shows more than 50% homology 



